Controlled English Advantages for Translated and Original English Documents
نویسندگان
چکیده
The Controlled English we have developed for Caterpillar is designed to reduce translation costs by facilitating machine translation. It is also preferred by English-speaking readers of the resulting documents. The approach to machine translation and the resulting design of the Controlled English were optimized to Caterpillar's environment and requirements, including a very high document and translation volume. The basic framework of the Caterpillar solution can be adapted to other, more typical, sets of documentation requirements. We describe an adaptation we are making for Diebold, based on machine-assisted human translation.
منابع مشابه
Title Generation for Machine-Translated Documents
In this paper, we present and compare automatically generated titles for machine-translated documents using several different statistics-based methods. A Naïve Bayesian, a K-Nearest Neighbour, a TF-IDF and an iterative Expectation-Maximization method for title generation were applied to 1000 original English news documents and again to the same documents translated from English into Portuguese,...
متن کاملOptimal Bilingual Data for French–English PB-SMT
We investigate the impact of the original source language (SL) on French–English PB-SMT. We train four configurations of a state-of-the-art PB-SMT system based on French–English parallel corpora which differ in terms of the original SL, and conduct experiments in both translation directions. We see that data containing original French and English translated from French is optimal when building ...
متن کاملHow to Verify Plagiarism of the Paper Written in Macedonian and Translated in Foreign Language?
AIM The aim of this study was to show how to verify plagiarism of the paper written in Macedonian and translated in foreign language. MATERIAL AND METHODS Original article "Ethics in Medical Research Involving Human Subjects", written in Macedonian, was submitted as an assay-2 for the subject Ethics and published by Ilina Stefanovska, PhD candidate from the Iustinianus Primus Faculty of Law, ...
متن کاملImplicit Ambiguity Resolution Using Incremental Clustering in Korean-to-English Cross-Language Information Retrieval
This paper presents a method to implicitly resolve ambiguities using dynamic incremental clustering in Korean-to-English cross-language information retrieval. In the framework we propose, a query in Korean is first translated into English by looking up Korean-English dictionary, then documents are retrieved based on the vector space retrieval for the translated query terms. For the top-ranked r...
متن کاملDocument Similarity in Repeatedly Translated Corpora
Preliminary communication The paper analyses the changes in relationship between documents in textual corpus that occur due to the translation into another language. Authors analyzed the similarities between documents in original corpus, in Croatian, and compared them with the corresponding documents in translated corpus, in English. The changes were analyzed using two measures, chi-square test...
متن کامل